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(57) Abstract: A circular transducer array (30) is provided for use in recording a sound field. The array (30) comprises a plurality 
of microphones (31a-3lh), a digital signal processor (33), frequency compensation filters (34) and a sum and difference network 
(35). The digital signal processor calculates the Fourier transform of sampled output signals from the transducers to produce a 
plurality of sound wave components specifying the sound field. The frequency compensation network (34) equalises each component 
using Bassel functions to flatten the apparent response of the array (30) and the sum and difference network (35) then combines the 
equalised components to provide a plurality of audio signals which represent the sound field. 
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MICROPHONE ARRAYS FOR HIGH RESOLUTION SOUND FIELD RECORDING 
FIELD OF THE INVENTION 

5 The present invention relates to an apparatus and method for use in the recording of 
sound fields. In particular it relates to a microphone array and associated hardware for 
producing a plurality of audio signals which represent a sound field to be recorded. The 
apparatus and method can be implemented in surround-sound, stereophonic and 
teleconferencing systems, although is not limited to such use. 

10 

BACKGROUND TO THE INVENTION 



Previous microphones have been developed primarily for use in sound reinforcement 
15 systems and for monophonic and stereophonic recording. Pressure microphones have 
an omnidirectional response, being equally sensitive to sounds arriving from all 
directions. First order gradient microphones were developed to provide a variety of 
directional responses, which can increase the potential acoustic gain in sound 
reinforcement systems in reverberant environments. These microphones also allow 
20 stereophonic recording with acceptable imaging within the loudspeaker angles. The 
gradient microphone is in many cases implemented as two closely spaced pressure 
elements with their outputs subtracted. This produces an approximation to the gradient, 
and a signal proportional to the sound velocity is obtained by integrating the difference 
signal. 

25 

Second order gradient microphones have also been developed which provide greater 
discrimination between sound from different angles of arrival. These typically consist 
of two gradient elements - each often consisting of two pressure elements - which 
produce the second spatial derivative with respect to one, or two axes. A pure second 
30 order response is obtained using the derivative with respect to two axes, and the four 
pressure elements form a square with their outputs combined with amplitudes of plus or 
minus one. This array produces a sin(20) polar response. A second square array is 
obtained by rotating the first by 45 producing a cos(20) response. If the outputs are 
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integrated twice, then at low frequencies the response is constant with frequency. 
Alternative implementations consist of two pressure gradient elements, or a single 
diaphragm open to the atmosphere at four points, with two openings to one side of the 
diaphragm and two openings connected to the other to produce the appropriate signs. 

5 

Higher order devices may also be built using three or more gradient elements and 
similar implementation methods to that of the second order microphones. For each 
order m, an mth order integration is required to produce a flat response with frequency. 

10 An alternative method for improving the discrimination of a microphone is to use two or 
more individual microphones, and to combine their outputs to produce one or more 
outputs which have higher directivity than a single element. More complex systems 
may be built using a larger array of microphones. Typically, prior art examples consist 
of a straight line of microphones with either equal or different inter-microphone 

15 separations, and use beam forming principles to produce one or more beams with sharp 
directivity in one or more directions. 

Surround sound systems offer the potential for improved sound localisation over stereo 
systems. Early quadraphonic systems brought to light some of the issues that affect the 

20 quality of reproduction, in particular the limitations of small numbers of loudspeakers, 
and the importance of the functions used to place individual sound sources in the 360 
degree sound field. The ambisonics system was developed independently by several 
researchers, and has proved to be a low order approximation to the holographic 
reconstruction of sound fields. The sound field is recorded using microphones that 

25 measure the spherical harmonics of the sound field at (theoretically) a point. The 
performance of the system becomes more accurate over wider areas as the number of 
loudspeakers and the number of spherical harmonics of the recorded sound field are 
increased. 

30 All current ambisonics systems are first order: that is, they use a recording microphone 
which records only the zeroth (pressure) and first (x, y and z components of velocity) 
responses. A prior art microphone designed specifically for this purpose is the 
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Soundfield microphone. Since only the first spatial harmonic is available, the resulting, 
reproduction demonstrates poor localisation. 

Most surround systems use only the horizontal (x and y) components of the velocity, 
5 since a) lateral localisation is more acute than vertical localisation, and b) the use of the 
z component requires loudspeakers to be positioned above the listener, which is often 
impractical. In this case the spatial harmonics are obtained from microphones with 
azimuthal polar responses of the form cos(m&) and sin(m0) . Each spatial harmonic 
greater than order zero therefore requires 2 channels. The total number of channels 
1 0 required to transmit or record all spatial harmonics up to order M is thus 2M+1 . 

Modern surround sound systems typically use five loudspeakers, and it has been shown 
that this allows the use of microphones which can measure up to the second order 
spatial harmonics of the sound field, requiring five channels. Surround systems using 
15 more than five loudspeakers will allow harmonics of orders greater than 2, and higher 
numbers of channels are required - for example, the inclusion of third order spatial 
harmonics require seven channels. 

The recently introduced DVD-Audio disk allows the recording of six channels of audio. 

20 It is thus capable of carrying recordings from second order microphone systems. Future 
audio disk technology will provide greater numbers of channels. While some second 
and higher order microphones have been developed in the past, there are currently no 
microphone systems commercially available which can measure spatial harmonics of 
order two or greater. There is thus a technology mismatch between the reproduction 

25 capability that DVD disks offer and the recording technology that current microphones 
can provide. A practical need therefore exists for the development of microphone 
systems that can accurately record the higher spatial harmonics of sound fields in the 
horizontal plane, and in particular, the second order responses. 

30 Consider a general sound pressure field p(x,y,z,t). The pressure in the plane z=0 is a 
three-dimensional function of x f y and This three-dimensional function may be 
equivalently expressed in terms of its three-dimensional Fourier transform 



t 
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where £is the vector wavenumber and (-jlc P) is chosen so that the pressure is 
represented by incoming waves which is relevant in surround systems, as opposed to 
5 outgoing waves in some texts. This equations shows that any sound field in the 
horizontal plane z=0 can be expressed as a sum of plane waves. 

Writing )c in terms of its two components u = &cos(0) and v = ksm{9) , where k = 
this may be written 

10 

As an example, a complex plane wave with radian frequency coo, magnitude B, phase <|> 
and angle of incidence 0 O has the form 



p(x y t) = Be^**^""^** 9 **^** (3) 

15 

where k 0 = a> 0 1 c and c is the speed of sound. The Fourier transform is 

P{u, v, a)) = A{2ny 6{u - k Q cos(0 o ))s{v - k 0 sin(0 o ))5{co - a> Q ) ( 4 ) 

where, for convenience, A = Be j * is the complex amplitude. The "spectrum" consists 
20 of a delta function at <o = o) 0 , u = k cos(d 0 \v = k sin(# 0 ) . Since P(u, v, co) exists only at 
one point, it may be represented as a vector 10 in wavenumber-frequency space 1 1 , as 
shown in Figure 1. In the (w,v) plane, the vector 10 has a projection 12 which is a 
vector of radius k 0 = & Q Ic and angle 0o relative to the u axis. 



25 



A real plane wave is given by the real part of equation 3, 



* 
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which can be written 



(6) 



The second term consists of a negative frequency complex plane wave with conjugate 
phase and the same positive wavenumber k 0 propagating in the opposite direction 
# 0 + 7t . The spectrum may be represented as two vectors in (w, v,a>) space. As a) 0 and 
0 O vary, the two vectors trace out a cone shape, since k^co/c . Thus the spectrum of 
any two-dimensional spatial pressure field lies in the cone <o = ±ck in the three- 
dimensional (u,v,cd) space. 

The pressure field is obtained from P(u, v,<y) by the inverse Fourier transform 

Writing i>(w,v,<y) in terms of spatial polar coordinates, u = &cos(0), v = £sin(#), and 
p(x,y,t) in terms of polar coordinates x = rcos(^), y = rsin(^) yields 

Since k = co/c the integral over <y is only nonzero for <y = ±Jtc . Hence 



P{k, 0, a) = 0, co)l7t[s{a) - *c) + <?(<y + kc)] 



(9) 



.1) I «• ' 
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and so 



+ ^rf f P{k,6-kc)e^- c '^ e -^kdkd9 

There are two special cases of interest. In the first, the signal contains only positive 
5 frequencies, (for example the complex plane wave considered above) and the pressure 
field is analytic. In this case the second integral is zero, and the analytic pressure field 
is 

10 The analytic case is useful for the analysis and design of surround systems. 

The second case of interest is real pressure fields, which occur in practice. In this case 
the spectrum in polar coordinates has the property 

P(k, 0,-kc) = P*{k,0 + 7r,kc) (12) 

15 

Substituting this in equation 1 0 



p R ( r> ^) = _L_r f* R e {p(k,9,kc)e Jk ^ rm ^}kdkd0 ° 3) 



Equations 1 1 and 13 both show that the pressure field is completely specified by a two 
20 dimensional spectrum S(k,0) = kP(k,0,kc) which specifies at each frequency, the 
complex amplitude of the plane wave arriving from each angle 0. S(k,0) may be 

termed the frequency-dependent source distribution. Since it is periodic in 8, it can be 
expanded in a Fourier series 
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Ef.frV - (14) 

The coefficients q m {k) are thus the "angular spectrum" of S(k,&) at each spatial 
frequency k , given by 

5 

The analysis is further simplified by examining each frequency component separately. 
In this case the sound field is "monochromatic", consisting of complex plane waves of 
the same frequency o> 0 arriving from all directions 8. In this case 

10 

P(k, 0, kc) = \S{K6) = k 0 )S Q (0) (1 6) 

where S 0 ($) = S(k 0 , 6) . Substituting this in equation 1 1 yields 

1 5 Thus a monochromatic sound field is expressed in terms of its one-dimensional source 
distribution. A simple example is a single plane wave with complex amplitude A 
arriving from direction 0 O . The source distribution is a delta function at 0 = 0 O and 
thus 



20 

and so the angular spectrum is 
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q m =Ae- J " e - (19) 



The monochromatic sound field may be written directly in terms of the spectrum of 
$0(0) °y substituting from equation 14, 



p Q {r,*,t) = e»« J g±[' ( 20 ) 

5 

which, with the identity 



i»j( z ) = ±t" e A»°+~os(e)) dd (21) 
In * 



yields 



10 



(22) 



This shows that the angular pressure field at radius r may be written as a sum of terms 
of the form exp(ym^). These have been termed "phase modes" in antenna array 
literature and the same terminology will be used here. The magnitude of each phase 
15 mode is the spectral coefficient multiplied by a Bessel function of the first kind which 
describes how the phase mode varies radially. 

An important feature of equation 22 is that for small k a r the Bessel functions of high 
orders are small and may be neglected without significantly affecting the pressure. 
20 Hence, for low frequencies, or for small radii, the phase mode expansion may be 
truncated to some maximum order m = ±M . However, as the frequency or radius 
increases, M must increase to preserve the accuracy of the expression. 



9 

As an example, the pressure due to a single plane wave at angle 0 Q is obtained from 
equations 19 and 22 with q m = Aexp(- jm0 Q ) 

m& -CO 

5 Thus the pressure field due to a plane wave consists of phase modes with magnitudes 
given by Bessel functions. 

By adding the terms in equation 22 m = 1 and m = -l, and noting that 
J-m [z) - (- l) m J m (z), the phase mode expansion may be written 

10 



Po 



r °» (24) 

yZy w k-^K(V)sinM] 



+ 1 

m\ 



Thus the pressure may be alternatively written as a sum of cosine and sine terms, which 
are known as amplitude modes. In cases where the spectrum of S(d) is Hermitian 
- 9* m )> can be written 

15 

[(25) 



-2Xy w Im{ 9 Jy w (* 0 r)sin(m^)] 



The spectrum of the plane wave (equation 19) is Hermitian, and substituting for q n 
yields the simpler and well-known form 



p 0 (r, * f ) = ^ [j 0 (At) + 2]T y V. (V)cos(m(0 o - *))] 



(26) 



WO 01/58209 




PCT/NZ01/00010 



10 



SUMMARY OF THE INVENTION 

It is an object of the invention to provide an apparatus and/or method for use in 
recording sound fields. In general terms the invention is directed towards a transducer 
5 array and associated hardware for producing an audio signal which represents a desired 
sound field. 

In one aspect the present invention may be said to consist of an apparatus for use in 
recording a sound source including:an array of transducer elements arranged about a 
10 point each of which produces an output signal in response to one or more incident sound 
waves from the source, and signal processing hardware which generates a plurality of 
audio signals representing the sound field using each transducer output signal. 

Preferably the microphones are cardioid microphones arranged to face radially 
15 outwards. Alternatively the microphones may be any type of omnidirectional or 
directional microphone. 

Preferably the compensation network includes a Bessel function based compensation 
function. 

20 

Preferably the output of the compensation network has an azimuthal angular response of 
the form e tJme or cos(ro#) or sin(m#) for m=0 to m=M, where M is the number of 
spatial harmonics calculated and 9 is the angle of incidence defined from some 
reference angle. 

25 

In another aspect the present invention may be said to consist of an apparatus for 
producing an audio signal representing a sound source including: a circular array of 
omnidirectional microphones for receiving one or more sound waves from the source,a 
digital signal processor for calculating a Fourier transform from the microphone outputs 
30 at sample times, one or more filters for equalising each component of the Fourier 
transform, and a network for combining the components into a plurality of audio 
signals. 
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In another aspect the present invention may be said to consist of an apparatus for 

producing an audio signal representing a sound source including: a circular array of 

cardioid microphones for receiving one or more sound waves from the source, 

a digital signal processor for calculating a Fourier transform from the microphone 

outputs at sample times, one or more filters for equalising each component of the 

Fourier transform, and a network for combining the components into a plurality of audio 

signals. 

In another embodiment the present invention may be said to consist of a method for 
recording a sound source including: sampling sound waves from the source at a 
plurality of locations, and signal processing the samples to produce a plurality of audio 
signals representing the sound field, wherein the waves are sampled at locations which 
are arranged about a point. 

Preferably the present invention provides a microphone array which can measure a 
plurality of spatial harmonics of a sound field in the horizontal plane, with polar 
responses that are substantially constant with frequency, and which avoid the difficulties 
that other microphones produce. The array processing is based on the Fourier transform 
combined with particular forms of frequency compensation, and yields circular phase 
and amplitude modes, which cannot be determined from existing systems. 

In a possible embodiment spherical harmonics are produced by an array with N 
elements, up to a maximum number M = (N/2-l) for M even, and A/ = (W-l)/2 
for N odd. An equalisation function is then used which extends the useable frequency 
response of the array over prior art arrays which use integrators. In this embodiment 
first order directional elements may be used in the array which eliminates zeros in the 
frequency responses of the array, further extending the frequency range over prior art 
systems. Such an embodiment can also simplify the construction process in comparison 
to existing microphone array apparatus. 
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BRIEF LIST OF FIGURES 

A preferred form of apparatus and method of the invention will be further described 
with reference to the accompanying drawings by way of example only and without 
5 intending to be limiting, wherein: 

Figure 1 shows a vector of a complex plane wave; 

Figure 2 shows prior art second order microphones based on two quadrapole arrays; 

10 

Figure 3 A shows a microphone array of omnidirectional microphones; 

Figure 3B is a block diagram illustrating the processing steps for the microphone 
outputs; 

15 

Figure 4 is a graph of the cosine response of a prior art quadrapole microphone; 
Figure 5 is a graph of the cosine response of a second order DFT microphone; 
20 Figure 6 is a graph of the cosine response of a second order DFT microphone; 
Figure 7 is a graph of the cosine response of a second order DFT microphone; 
Figure 8A shows a circular microphone array of cardioid microphones; 

25 

Figure 8B is a block diagram illustrating the processing steps for the microphone 
outputs; 

Figure 9 is a graph of the cosine response of a quadrapole microphone array using 
30 cardioid microphones; 
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Figure 10 is a graph of the cosine response of a second order DFT microphone array 
using cardioid microphones; 

Figure 1 1 is a graph of the cosine response of a second order DFT microphone array 
5 using cardioid microphones; 

Figure 12 is a graph of the required compensation for a second order DFT cardioid 
microphone system; 

10 Figure 13 is a graph of the cosine response of a third order DFT microphone array with 
cardioid elements; and 

Figure 14 is the required compensation for the third order DFT microphone array. 

1 5 DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Figure 2 shows an existing array 20 comprising two prior art second order microphones 
21, 22 based on two quadrapole arrays. These microphones 21, 22 typically consist of 
two gradient elements - often each consisting of two pressure elements. The system 

20 produces the second spatial derivative with respect to one or two axes. The closed 
circles 1, 2, 3 and 4 represent the first second order microphone 21 and the open circles 
5, 6, 7 and 8 represent the second second order microphone 22 . The second order 
microphone 22 represented by the open circles produces a s\n(29) polar response and 
the second order microphone 21 represented by the closed circles produces a cos(29) 

25 polar response. Together these two microphones 21, 22 produce the second spatial 
harmonic as described by the Fourier series when their outputs are combined as shown 
by the +1 and -1 beside each circle. 



One embodiment of the invention 30, 32 shown in Figures 3a and 3b provides improved 
30 frequency response of a microphone array over existing arrangements. The background 
theory has shown that the sound pressure over a given region is equivalently described 
by the two dimensional source distribution S(k,0) = P(k,0 y kc). Equation 22 provides a 
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way to determine the spectral coefficients of S Q (0) from p(r,fi,t). The pressure is 
itself a periodic function of 0, and therefore has Fourier coefficients z m given by 

z m =^l'p(rj,t)e-^ (27) 



5 Substituting from equation 22 yields 



(28) 



= e Ja *TJ m (k 0 r) qm 



by orthogonality of the phase modes. Hence 



4* = ^rr-r, r f P<rJ,t)e'*«dt 



(29) 



2xj m J m (k 0 r) 
10 

Thus the spectral coefficients of the source distribution may be obtained from the 
Fourier transform of the pressure on a circle, equalised by Bessel functions. 

In practice, the recording is carried out using a discrete circular array of omnidirectional 
15 microphones, so that the pressure field is sampled. We now consider the effects of this 
sampling on the continuous case. 

The sampling that occurs using a discrete array of microphones can be taken into 
account by multiplying the pressure p(r, <f> t t) by a train of delta functions of the form 

20 



= iy w > 



I2n 

' N 



(30) 
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The second equivalent form will be useful for examining the aliasing caused by 
sampling. 

The microphone array response s m (t) formed by substituting the delta function train into 
5 equation 27 is 




which is the DFT of the samples of the pressure at N equally spaced angles. If the 
second form of the sampling function is inserted, the result is 

10 

*.('>- Zy-f PinWW (32) 

CO 

This form shows that the discrete array produces the sum of the [m-lN] phase modes 
obtained from the continuous integral (equation 27). The mth mode is the desired one 
and those for 1*0 are aliases. This equation is useful because it shows that the discrete 
15 array responses can be determined directly from the continuous integral in equation 27. 

Substituting for z m from equation 28 and q m from equation 19 yields the response of the 
discrete array to a complex plane wave from direction 0 0 



(33) 



20 



This expression shows the alias phase modes explicitly, and may also be derived 
directly from the discrete sum in equation 31. For low frequencies or small radii, the 
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1=0 term dominates, yielding the complex sinusoidal signal multiplied by the mth phase 
mode of the plane wave 



s m (/) = Aj m J m (kr)e j6 »e- Jm$ ° (34) 

5 However, at higher frequencies higher order aliases will begin to be significant, 
introducing unwanted sidelobes into the mth polar response. For cases where the aliases 
are small, the array output must be equalised by a function 



em— — (35) 



10 in order to produce a response which is constant with frequency. The equalisation may 
be carried out up to the frequency where J m (kr) is equal to zero. At this point the 
equalisation function is infinite. This marks the upper frequency limit of the array. The 
frequency range is therefore specified by the array radius r, with smaller radii allowing 
a wider frequency range. 



15 



20 



The circular array with DFT processing is a generalisation of the prior art quadrapole 
microphones 1 1, 12 shown in Figure 1. This may be shown as follows: The amplitude 
mode responses for a plane wave input may be determined from equation 3 1 



(36) 



and 



(27dm) (37) 



From equations 36 and 37 it is apparent that for N=8 and m=2 the cosine mode uses 
25 only the 0, 2, 4, 6, elements since cos(nn/2) is zero for odd n. The signs for the non- 
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zero elements are (-1) . Similarly the sine mode response is zero for the even elements 
and the signs are (-\) {n ' I)/2 . The 8 element array with DFT weightings thus produces the 
same responses as the two quadrapole microphones in figure 1. Higher numbers of 
elements also produce circular arrays with amplitude weightings of ±1. For example an 
5 N=J2 element array produces two hexagonal arrays with alternating sign weightings for 
aw=5 and these two arrays produce cos(36) and sin(36) polar responses. In general, the 
DFT approach produces all circular multipole arrays for N=4m, but also allows the 
implementation of a greater number of responses using other numbers of microphones, 
and with complex amplitude weightings. 

10 

Figure 3 A shows a circular microphone array 30 of 8 omnidirectional microphones 31a 
to 31h. The microphones 31a to 3 lh are evenly spaced around a circle of uniform 
radius. These microphones receive sound from all directions equally and cannot 
individually distinguish the direction of origin of a sound wave. A sound wave 39 

15 arrives at the microphone array at angle do. This sound wave is detected by all the 
microphones 31a to 31 h. The outputs of the microphones are passed to an equalisation 
network. Figure 3B shows the processing blocks 32 used to equalise the outputs of the 
microphones 31a to 31h to produce the best frequency response. The outputs of the 
microphones 3 la to 3 lh are first processed in an N-point DFT block 33 before passing 

20 through a frequency compensation network 34 containing a Bessel function based 
equaliser function. Following this the signals pass through a sum and difference 
network 35 to produce amplitude node responses. The output of the sum and difference 
network 35 is in terms of the spatial harmonics of the microphones 31a to 31 h. 

25 The DFT block 33, frequency compensation network 34 and sum and difference 
network 35 may be readily implemented by those skilled in the art based on the 
explanations of the nature of the array disclosed in this specification. The frequency 
compensation network 34 may utilise FIR or IIR filters. 

30 The DFT array 30 allows a number of harmonics to be measured from a single array, up 
to (in principle) the positive Nyquist value 
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M = 



N 1 M 
1 Yv even 

2 

Nodd 



(38) 



Figure 4 shows as a solid line the unequalised cosine response 42 of the prior art 
quadrapole with cos (2 6) polar response, for a plane wave field arriving from 0=0 
degrees and an array radius of 50mm. The lowest order response 40 (equation 34) is 
5 shown dash-dotted. The lowest order response 40 is equal to the actual output of the 
discrete array up to about 3 kHz, above which the first alias term begins to be 
significant. The response 41 of a second order differentiator is shown dashed. This is 
the response that would be perfectly equalised by a prior art second order integrator, and 
is the low frequency approximation to the Bessel function. At low frequencies (less 

10 than about 1 kHz) the integrator will produce a constant output with frequency, but at 
higher frequencies the integrator output will begin to reduce. Using the lowest order 
Bessel function equalisation extends the quadrapole response up to 3 kHz, and including 
the first alias will further extend the frequency range. At 6.8 kHz, the array output is 
zero, and equalisation is not possible, and so the upper frequency limit is in the region 

15 of 6 kHz. Using a smaller array radius will produce a higher frequency limit, but the 
low frequency equalisation gain will become larger. This is the classical trade-off in 
microphone design that typically requires the microphone elements to be close together 
to produce a wide frequency range, or the use of two-way designs. 

20 Figure 5 shows as solid lines the unequalised frequency responses 50, 51, 52 of the 
second cosine amplitude mode produced by a DFT array with N=7 elements arranged at 
a radius of 50mm for input angles 0, 22.5 and 45 respectively. The lowest order 
responses 53, 54 that would be obtained using a continuous array are shown dash-dotted 
for each angle. The ideal response is zero for 45 degrees but the actual responses 50, 

25 51,52 rise above 2 kHz due to the higher order aliases. 



Figure 6 shows as solid lines the unequalised second order cosine responses 60, 61, 62 
of an N=8 DFT array with a radius of 50 mm and input angles 0, 22,5 and 45 
respectively. The lower order responses 63, 64, 65 are^ shown in as a dash dotted line. It 
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has the same form as the quadrapole response in figure 4, as expected. The N-7 (of 
figure 5) responses are closer to the lowest order responses than the N=8 responses, 
possibly because they use all the microphone elements, but the 45 degree response is 
not zero as it is for the N=8 case. However, the actual 0 (60) and 22.5 (61) degree 
5 responses in figure 6 produce zeroes at higher frequencies, making equalisation 
impossible above around 5 kHz for 0=22.5 degrees and around 6.5 kHz for 0 degrees. 

An important advantage of the DFT approach is that if a higher number of microphone 
elements are used, the aliasing terms are pushed higher in frequency. This is a well 

10 known property of sampling theory. It is demonstrated in equation 33, which shows 
that the next two higher Bessel functions after the mth have orders N-m and N+m. 
Thus, for m=2 and N=8 the first alias has order 6 and the second has order 10. Using 
N>8, however, results in reduced aliasing. For example, with N=12 microphones, the 
first alias magnitudes are Jiofkr) and Ju(kr). The cosine amplitude mode response 70 

15 with 12 elements is shown in figure 7 with &=0 degrees and a 50mm array radius. The 
lowest order response 71 is identical to that of the quadrapole response in figure 4, but 
the actual response is now equal to the lowest order response 71 up to about 7 kHz, as 
opposed to only 3 kHz for the zero degree response in figure 6. This shows that the 
higher order aliases are less significant. Thus, for sufficiently large numbers of array 

20 elements, equation 35 is the correct equalisation function over the entire useable 
frequency range. 

The analysis above assumes a complex plane wave input. In practice the sound pressure 
is a real function, and each positive frequency is associated with a negative counterpart. 

25 The DFT array response is thus the sum of the positive and negative frequency 
responses. Putting k~k in equation 35 and noting that J m (-z) ~ (-l) m J m (z) shows that 
the equalisation filter response for the negative frequency is the conjugate of the 
positive frequency value. Hence the equalisation filter transfer function is Hermitian 
and the impulse response is therefore real. The processing for real pressure signals is 

30 therefore unchanged. The DFT processor produces complex outputs for each phase 
mode, ie two signals representing the real and imaginary components. Both 
components are then filtered by the real equalisation filter to produce frequency 
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independence. The complex phase mode signals may then be combined to produce real 
amplitude mode outputs. 

Another, preferred, embodiment of the invention is shown in figures 8a and 8b which 



discussed so far produce zeros in the frequency response where equalisation is not 
possible. However, this problem may be avoided by constructing an array 80 using first 
order directional microphone 81a-81h. The output from the array 80 can be equalised 
using signal processing hardware 82 comprising a DFT 83, frequency compensation 
10 filters 84 and a sum and difference network 84. Each directional microphone element 
81a-81h has a response: 



Each microphone element 81a to 8 lh has its main lobe "looking outward" (radially) 
15 from the array centre, as shown in figure 8a. The first order microphone consists of the 
combination of a pressure and velocity response, and so the array response may be 
determined as the sum of the pressure response for a complex plane wave, determined 
in the previous section (equation 28), and the velocity response 



Applying the sampling function to this integral again shows that the discrete array 
response consists of a sum of the m^lN phase mode responses. Therefore we need 
consider only the continuous integral. 

25 The 1=0 velocity response is found using 



5 also provides an apparatus with improved frequency response. The microphone arrays 



p„(0) = a + (l-a)cos(0-0 n ) 



(39) 




(40) 
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d 



(41) 
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and is 



Vo (0 = Ae J ^r x f m (kr)e- Jme (42) 

where oT(fo-)is the derivative of J m (kr), and hence the array responses using N 
outward-facing velocity microphones are 

* M co - ir-'v: WA ,(Ar) e -><-'^ (43) 

Adding the pressure (33) and velocity (43) responses according to (39) yields 

V.(0 = ^ Zy"-'1^. w (Ar)-y(l-a)/;. W (*r)k y(w - W)fl (44) 

/ts-<0 

The ideal first order element responses (7=0) are thus 

V. (0 = k- (*r) - 7(1 - «)/. 0)]^ (45) 
which requires the equalisation function 



(46) 



aI m {kr)~j\\-a)J' m {kr) 
In practice the derivative of the Bessel function may be determined from the identity 



j, (z ^ J„-M-J m M (47) 



Equation 46 shows that the problems with the zeros of J m (kr) are removed. Since the 
derivative of the Bessel function is zero at different points, the sum of the two is non- 



WO 01/58209 




PCT/NZ01/00010 



22 

zero for all frequencies. However, the actual array response (including aliases) only 
produces non-zero magnitudes for suitably large M 

The unequalised response 90 of a cardioid array of radius 50mm with N-8 elements 
5 (the quadrapole case), a=0.5 (cardioid) and 9=0 degrees is shown in figure 9. The 
lowest order response 91 has no zeros, but the discrete array still produces zeros in its 
response. The cosine amplitude response 100 magnitude for N=12 cardioid array of 
radius 50mm with a=0.5 (cardioid) and 0=0 degrees is shown in figure 10. The actual 
response now follows the lowest order response 101 up to a frequency of about 6 kHz as 

10 opposed to 3 kHz for the quadrapole. More importantly, the reduction of aliases has 
produced a response with no zeros. This means that the frequency compensation can be 
carried out over a wide bandwidth with no difficulty. The cardioid element produces 
the lowest variation in frequency response. This is because each element has its null 
pointed at the opposite side of the array, which minimises comb filtering caused by 

15 wavefronts crossing from one side of the array to the other. 

As a more practical example, consider an array of 16 cardioid elements with radius 
30mm. The uncompensated cosine response 110 for an input angle of zero degrees is 
shown in figure 1 1 along with the low order response 1 1 1 and the required magnitude 

20 compensation 120 in figure 12. The DFT array response is non-zero over the entire 
audio bandwidth, and this is true for all angles, with a cos (20) weighting of the 
response. Furthermore, the compensation gain variation is considerably less than would 
be required for a prior art quadrapole using two integrators. This is because the mth 
harmonic response using directional elements introduces a Bessel function of order m-1 

25 (equation 47), which has a greater amplitude at low frequencies. A double integrator 
reduces by 40dB per decade, requiring 120dB gain variation from 20 Hz to 20kHz. The 
example in figure 12 demonstrates only 45dB variation, reducing low frequency noise 



problems. 



30 



Finally, the third order uncompensated cosine response 130 for N=16, R= 30mm input 
angle of zero degrees along with the low order response 131 is shown in figure 13 and 
the required compensation gain 140 in figure 14. The response is still well-behaved, 
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and the gain variation is now around 95 dB, which is less than the 180dB which would 
be required for a closely spaced six element multipole using three integrators. 

The frequency magnitude and phase compensation of the DFT responses produces - 
5 ideally - flat responses with linear phase. The compensation filters are inverse filters 
that compress the dispersive impulse responses produced by the array and DFT 
processing back to the ideal impulse response, retaining the required angle dependence 
of the amplitude. This means that coincident microphones are not required. Surround 
sound recordings may thus be made using standard, high quality directional 
1 0 microphones and FFT and digital filter post-processing techniques. 



Finally, a circular array may also be useful in areas of application other than surround 
sound systems, such as teleconferencing systems. Surround reproduction may be 
carried out using techniques such as ambisonics. Even if other reproduction methods 

15 are used, the circular microphone array is still useful for discriminating between 
speakers over 360 degrees. The directivity of a circular array is not as high as that of a 
linear array, which - for similar inter-element spacings — has an aperture of about it 
times that of the circular array. However, the circular array offers beam patterns that can 
be rotated around 360 degrees without the variable beam widths that occur in linear 

20 arrays, and may be placed for example in the centre of a table. Furthermore, since the 
amplitude mode responses are independent of frequency, the circular array can provide 
beam patterns that are constant with frequency, avoiding the high frequency roll-off that 
can occur with standard linear arrays. 

25 The descriptions given herein are not intended to be restrictive, and other 
implementations or examples of the generic forms derived will be understood by those 
skilled in the art. 
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Claims: 

1 . An apparatus for use in recording a sound field including: 

an array of transducer elements arranged about a point each of which produces 
5 an output signal in response to one or more incident sound waves from the field, and 

signal processing hardware which generates audio signals representing the sound 
field using each transducer output signal. 

2. An apparatus according to claim 1 wherein the hardware generates an audio 
10 signal with a characteristic such that the apparent frequency response of the apparatus is 

flattened over at least a portion of an audio bandwidth. 

3. An apparatus according to claim 1 wherein the hardware includes: 

a digital signal processor for calculating a Fourier transform of the output signals 
1 5 from the transducers to specify the sound wave as a plurality of components, 

one or more filters for equalising each component to flatten the response over at 
least a portion of the audio band, and 

a network to combine the equalised components into an audio signal. 

20 4. An apparatus according to claim 3 wherein the components are spatial 
harmonics of the sound field. 

5. An apparatus according to claim 4 wherein the one^or more filters equalise the 
components using a function based on one or more Bessel functions and derivatives of 

25 Bessel functions. 

6. An apparatus according to claim 5 wherein the array is a substantially circular 
arrangement of substantially equally spaced transducers. 

30 7. An apparatus according to claim 6 wherein the Bessel functions are selected 
based on components which contribute significantly to the magnitude of the sound 
wave. 
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8. An apparatus according to claim 7 wherein the portion of the audio band over 
which the Bessel functions and derivatives equalise the response can be extended by 
reducing the significance of higher order components. 

5 

9. An apparatus according to claim 8 wherein the significance of higher order 
components can be reduced by increasing the number of transducers comprising the 
array. 

10 10. An apparatus according to claim 6 wherein the significance of higher order 
components can be reduced by reducing the radius of the array. 

11. An apparatus according to claim 10 wherein the portion over which the 
frequency is flattened is extended to substantially the entire audio band by using 

15 transducers which are first order microphones. 

12. An apparatus according to claim 1 wherein each transducer is an omnidirectional 
microphone. 

20 13 An apparatus according to claim 1 wherein each transducer is a cardioid 
microphone. 

14 An apparatus according to claim 1 wherein there are at least 8 transducers in the 
array. 

25 

15. An apparatus for producing audio signals representing a sound field including: 

a circular array of omnidirectional microphones for receiving one or more sound 
waves from the field, 

a digital signal processor for calculating a Fourier transform from the 
30 microphone outputs at sample times, 

one or more filters for equalising each component of the Fourier transform, and 
a network for combining the components into the audio signals. 
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16. An apparatus according to claim 15 wherein the Fourier transform of the mth 
output of the array of microphones is specified by: 

s m {t) = Ae J " ir^J^^e'^-^ 
/=-« 

5 

where s m (t) is the unequalised response of the microphone array, m is the mode of the 
array, N is the number of microphones, A is the amplitude of an incident sound wave 
from the field and 6q is the angle of the sound wave. 

10 17. . An apparatus according to claim 16 wherein the response of the array to low 
sound wave frequencies can be approximated by: 

s m (t) = Aj m J m (kr)e J *»e- Jm »° 

18. An apparatus according to claim 17 wherein the one or more filters equalise the 
15 approximate response by implementing the function: 

1 ' j m J m {kr) 

19. An apparatus according to claim 18 wherein the upper sound wave frequency at 
which the Fourier transform is equalised can be increased by increasing the number of 

20 microphones in the array. 

20. An apparatus according to claim 1 8 wherein the upper sound wave frequency at 
which the Fourier transform is equalised can be increased by reducing the radius of the 
array. 

25 

21 . An apparatus for producing audio signals representing a sound field including: 
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a circular array of first order microphones for receiving one or more sound 
waves from the field, 

a digital signal processor for calculating a Fourier transform from the 
microphone outputs at sample times, 
5 one or more filters for equalising each component of the Fourier transform, and 

a network for combining the components into the audio signals. 

22. An apparatus according to claim 21 wherein the approximate Fourier transform 
of the mth output of the array of microphones in response to low sound wave 

10 frequencies is specified by: 

where s m (t) is the approximate unequalised response of the microphone array, m is the 
mode of the array, N is the number of microphones, A is the amplitude of an incident 
15 sound wave from the field and 0 O is the angle of the sound wave. 

23. An apparatus according to claim 22 wherein the one or more filters equalise the 
approximate response by implementing the function: 



EM)-- J 



20 



24. An apparatus according to claim 23 wherein the upper sound wave frequency at 
which the Fourier transform is equalised can be increased by increasing the number of 
microphones in the array. 



25 



25. An apparatus according to claim 24 wherein the upper sound wave frequency at 
which the Fourier transform is equalised can be increased by reducing the radius of the 
array 
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26. An apparatus according to claim 25 where a is set to 14 to produce cardioid 
elements. 

27. A method for recording a sound field including: 

5 sampling sound waves from the field at a plurality of locations arranged about a 

point, and 

signal processing the samples to produce a plurality of audio signals 
representing the sound field, 

10 28. A method according to claim 27 wherein the samples are processed in a manner 
to produce audio signals with characteristics such that the apparent response of sampled 
sound waves for a range of wave frequencies is flattened. 

29. A method according to claim 28 wherein processing the samples includes 
15 equalising sample components to flatten the apparent frequency response of sampled 

sound waves for a range of wave frequencies. 

30. A method according to claim 29 wherein the samples are equalised using 
functions based on one or more Bessel functions and derivatives of Bessel functions. 

20 

31. A method according to claim 30 wherein the range of wave frequencies over 
which the response is flattened can be extended by sampling the sound waves at more 
locations. 

25 32. A method according to claim 30 wherein the samples are taken at substantially 
evenly spaced locations about a circle. 

33. A method according to claim 32 wherein the samples are taken from the output 
of transducers placed at each location. 

30 
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34. A method according to claim 32 wherein the range of wave frequencies over 
which the response is flattened can be extended by reducing the circumference of the 
circle. 



35. A method according to claim 34 wherein the range of wave frequencies over 
which the response is flattened can be extended to substantially the entire audio 
bandwidth by using transducers which are first order microphones. 
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FIGURE 3a 



M 0 o- 



M N ., o- 



N Point 
FFT 



33' 



Frequency 

Compensation 

Filters 



Omni 


Sum and 


exp(j9) 




exp(-j0) 


• 
* 


Difference 


expOM9) 


Networks 






exp(-jM8) 



34' 



35 



> Omni 

> cos(G) 

> sin(8) 



> cos(M9) 
>sm(M6) 



32 
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FIGURE 4 
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FIGURE 8a 
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